Biologically-Based Interactive Neural Network Models for Visual Attention and Object Recognition
نویسنده
چکیده
The main focus of this thesis is to develop biologically-based computationalmodels for object recognition. A series of models for attention and objectrecognition were developed in order of increasing functionality and complex-ity. These models are based on information processing in the primate brain,and especially inspired from the theory that visual information processingoccurs along two parallel processing pathways in the primate's visual cortex,the ventral pathway and the dorsal pathway. To capture the true essence ofincremental, constraint satisfaction processing in the visual system, interac-tive neural networks were used for implementing our models. Results fromeye-tracking studies on the relevant visual tasks, as well as our hypothesisregarding information processing in the primate visual system, were imple-mented in the models and tested with simulations. As a rst step, a model based on the ventral pathway was developed torecognize single objects. Through systematic testing, structural and algo-rithmic parameters of this model were ne tuned for performing its taskoptimally. In the second step, the model was extended by considering thedorsal pathway, which enables simulation of visual attention as an emergentphenomenon. The extended model was then investigated for visual searchtasks, where one object is to be identi ed among other objects. In the laststep, we focussed on occluded and overlapped object recognition. The modelwas further advanced on the lines of the presented hypothesis, and simulatedon the tasks of occluded and overlapped object recognition. On the basis of the results and analysis of our simulations we have found thatthe generalization performance of interactive hierarchical networks improveswith the addition of a small amount of Hebbian learning to an otherwise pureerror-driven learning. We also concluded that the size of the receptive eldin our networks is an important parameter for the generalization task anddepends on the object of interest in the image. Our results also show thatnetworks using hard coded feature extraction perform better than the net-works that use Hebbian learning for developing feature detectors. We havesuccessfully demonstrated the emergence of visual attention within an inter-active network and also the role of context in the search task. Simulation
منابع مشابه
Exploring Biologically-Inspired Interactive Networks for Object Recognition
This thesis deals with biologically-inspired interactive neural networks used for the task of object recognition. Such networks offer an interesting alternative approach to traditional image processing techniques. Although the networks are very powerful classification tools, they are difficult to handle due to their bidirectional interactivity. This is one of the main reasons why these networks...
متن کاملMachine learning based Visual Evoked Potential (VEP) Signals Recognition
Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...
متن کاملObject recognition with hierarchical discriminant saliency networks
The benefits of integrating attention and object recognition are investigated. While attention is frequently modeled as a pre-processor for recognition, we investigate the hypothesis that attention is an intrinsic component of recognition and vice-versa. This hypothesis is tested with a recognition model, the hierarchical discriminant saliency network (HDSN), whose layers are top-down saliency ...
متن کاملSynchronisation - Based Computational Model of Attention - Guided Object Selection and Novelty Detection
We develop a new biologically inspired oscillatory model that combines consecutive selection of objects and discrimination between new and familiar objects. The model works with visual information and fulfils the following operations: (1) separation of different objects according to their spatial connectivity; (2) consecutive selection of objects located in the visual field into the attention f...
متن کاملAircraft Visual Identification by Neural Networks
In the present paper, an efficient method for three dimensional aircraft pattern recognition is introduced. In this method, a set of simple area based features extracted from silhouette of aerial vehicles are used to recognize an aircraft type from its optical or infrared images taken by a CCD camera or a FLIR sensor. These images can be taken from any direction and distance relative to the fly...
متن کامل